The most probable string: an algorithmic study

نویسندگان

  • Colin de la Higuera
  • José Oncina
چکیده

The problem of finding the consensus (most probable string) for a distribution generated by a weighted finite automaton or a probabilistic grammar is related to a number of important questions: computing the distance between two distributions or finding the best translation (the most probable one) given a probabilistic finite state transducer. The problem is undecidable with general weights and is NP-hard if the automaton is probabilistic. We give a pseudo-polynomial algorithm that solves a decision problem directly associated with the consensus string and answers if there is a (reasonably short) string whose probability is larger than a given bound in time polynomial in the the size of this bound, both for probabilistic finite automata and probabilistic context-free grammars. We also study a randomized algorithm solving the same problem. Finally, we report links between the length of the consensus string and the probability of this string.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding the Most Probable String and the Consensus String: an Algorithmic Study

The problem of finding the most probable string for a distribution generated by a weighted finite automaton or a probabilistic grammar is related to a number of important questions: computing the distance between two distributions or finding the best translation (the most probable one) given a probabilistic finite state transducer. The problem is undecidable with general weights and is NP-hard ...

متن کامل

Bézier curve string method for the study of rare events in complex chemical systems.

We present a new string method for finding the most probable transition pathway and optimal reaction coordinate in complex chemical systems. Our approach evolves an analytic parametric curve, known as a Bézier curve, to the most probable transition path between metastable regions in configuration space. In addition, we demonstrate that the geometric properties of the Bézier curve can be used to...

متن کامل

Computing the Most Probable String with a Probabilistic Finite State Machine

The problem of finding the consensus / most probable string for a distribution generated by a probabilistic finite automaton or a hidden Markov model arises in a number of natural language processing tasks: it has to be solved in several transducer related tasks like optimal decoding in speech, or finding the most probable translation of an input sentence. We provide an algorithm which solves t...

متن کامل

Parallel Genetic Algorithm Using Algorithmic Skeleton

Algorithmic skeleton has received attention as an efficient method of parallel programming in recent years. Using the method, the programmer can implement parallel programs easily. In this study, a set of efficient algorithmic skeletons is introduced for use in implementing parallel genetic algorithm (PGA).A performance modelis derived for each skeleton that makes the comparison of skeletons po...

متن کامل

Numerical Study for the Nucleation of One-dimensional Stochastic Cahn-hilliard Dynamics

Abstract. We consider the nucleation of one-dimensional stochastic Cahn-Hilliard dynamics with the standard double well potential. We design the string method for computing the most probable transition path in the zero temperature limit based on large deviation theory. We derive the nucleation rate formula for the stochastic Cahn-Hilliard dynamics through finite dimensional discretization. We a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Log. Comput.

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2014